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Abstract. An infinite word has the property Rm if every factor has exactly m return words. 
Vuillon showed that R2 characterizes Sturmian words. We prove that a word satisfies Rm if its 
complexity function is (m — l)n + l and if it contains no weak bispecial factor. These conditions 
are necessary for m = 3, whereas for m = 4 the complexity function need not be 3n + 1. New 
examples of words satisfying Rm are given by words related to digital expansions in real bases. 



1. Introduction 

Recently, return words have been intensively studied in (symbolic) dynamical systems, com- 
binatorics on words and number theory. Roughly speaking, for a given factor w of an infinite 
word w, a return word of w is a word between two successive occurrences of the factor w. This 
can be seen as a symbolic version of the first return map in a dynamical system. This notion 
was introduced by Durand [5] to give a nice characterization of primitive substitutive sequences. 
A slightly different notion of return words was used by Ferenczi, Mauduit and Nogueira [5]. 

Sturmian words are aperiodic words over a biliteral alphabet with the lowest possible factor 
complexity; they were defined by Morse and Hedlund [TT]. Using return words, Vuillon [T3] found 
a new equivalent definition of Sturmian words. He showed that an infinite word u over a biliteral 
alphabet is Sturmian if and only if any factor of u has exactly two return words. A short proof of 
this fact is given in Section [5l 

A natural generalization of Sturmian words to m-letter alphabets is constituted by infinite 
words with every factor having exactly m return words. This property is called Rm- It covers other 
generalizations of Sturmian words: Justin and Vuillon [lOj proved that Arnoux-Rauzy words of 
order m satisfy Vuillon [1A\ proved this property for words coding regular m- interval exchange 
transformations. 

The factor complexity, i.e., the number of different factors of length n, of the two classes of 
words with property in the preceding paragraph is (m — l)n + 1 for all n > 0. Vuillon [14j 
observed that this condition is not sufficient to describe words satisfying i?„j, to > 3: the fixed 
point of a certain recoding of the Chacon substitution, which has complexity 2n + 1 by Ferenczi [6] , 
has factors with more than 3 return words. 

A deeper inspection of the two classes of words with property R,n shows that not only the 
first difference of complexity is constant, but also that the bilateral order of every factor (see 
Cassaigne [4 and Section 2]) is zero. We show that this condition is indeed sufficient to have the 
property Rm , and provide a less known class of words satisfying this condition. If a word satisfies 
i?3, then we can show that no factor is weak bispecial, i.e., no factor has negative bilateral order. 
Therefore the words with i?3 are characterized by complexity 2n + 1 and the absence of weak 
bispecial factors. 

In Section fe.il we provide a word satisfying R-^ with an even number of factors of every positive 
length (containing infinitely many weak bispecial factors). Therefore words satisfying Rm do not 
necessarily have complexity (to — l)n + 1, and it is an open question whether there exists a nice 
characterization of words satisfying Rm for m > 4. 

We conclude the article by exhibiting a large class of purely substitutive words satisfying Rm- 
Every word in this class codes the sequence of distances between consecutive /3-integers for some 
real number (3 > 1- 
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In this article we focus only on the number of return words corresponding to a given factor of 
an infinite word. We do not study the ordering of return words in the infinite word, i.e., we do 
not study derivated sequences (see [5] for the precise definition). Let us just mention here that a 
derivated sequence of a word with property Rm is again a word satisfying R^- A description of 
derivated sequences of Sturmian words can be found in [1] . 

2. Basic definitions 

An alphabet ^ is a finite set of symbols called letters. A (possibly empty) concatenation of 
letters is a word. The set A* of all finite words provided with the operation of concatenation is 
a free monoid. The length of a word w is denoted by \w\. A finite word w is called a factor (or 
subword) of the (finite or right infinite) word u if there exist a finite word v and a word v' such 
that u — vwv' . The word w is a prefix of u if w is the empty word. Analogously, w is a, suffix of u 
if v' is the empty word. A concatenation of k words w will be denoted by . 

The language C(u) is the set of all factors of the word u, and Cn{u) is the set of all factors of 
u of length n. Let w be a factor of an infinite word u and let a, 6 G ^. If aw is a factor of w, then 
we call a a left extension of w. Analogously, we call b a right extension of w if wb S C{u). We 
will denote by £t{w) the set of all left extensions of w, and by £r{w) the set of right extensions. 
A factor w is left special if ^£e{w) > 2, right special if 41=£r{w) > 2 and bispecial if w is both left 
special and right special. 

Let w be a factor of an infinite word u = uqUi ■ ■ ■ (with Uj A), Iwl — £. An integer j is called 
an occurrence of w in u if UjUj+i • • -Uj+i^i — w. Let j, fc, j < k, be successive occurrences of w. 
Then UjUj^i ■ ■ ■ Uk-i is a return word of w. The set of all return words of w is denoted by TZ{w), 

TZ(w) — {ujUj+i . . . Uk^i I J, k being successive occurrences of w in u}. 

If w is a return word of w, then the word vw is called complete return word. 

An infinite word is recurrent if any of its factors occurs infinitely often or, equivalently, if any 
of its factors occurs at least twice. It is uniformly recurrent if, for any n G N, every sufficiently 
long factor contains all factors of length n. It is not difficult to see that a recurrent word on a 
finite alphabet is uniformly recurrent if and only if the set of return words of any factor is finite. 

The variability of local configurations in u is expressed by the factor complexity function (or 
simply complexity) C{n) = ^Cn{u). It is well known that a word u is aperiodic if and only if 
C (n) > n + 1 for all n G N. Infinite aperiodic words with the minimal complexity C (n) = n + 1 
for all n e N are called Sturmian words. These words have been studied extensively, and several 
equivalent definitions of Sturmian words can be found in Berstel [3]. 

3. Simple facts for return words 

3.1. Restriction to bispecial factors. If a factor w is not right special, i.e., if it has a unique 
right extension b £ A, then the sets of occurrences of w and wb coincide, and 

TZ{w) = n{wb). 

If a factor w has a unique left extension a G then j > 1 is an occurrence of w in the infinite word 
u if and only if j — 1 is an occurrence of bw. This statement does not hold for j — 0. Nevertheless, 
if u is a recurrent infinite word, then the set of return words of w stays the same no matter whether 
we include the return word corresponding to the prefix w of m or not. Consequently, we have 

TZ{aw) — aTZ{w)a^^ — {ava^^ \ v G TZ{w)}, 

where ava~^ means that the word v is prolonged to the left by the letter a and it is shortened 
from the right by erasing the letter a (which is always the suffix of v for v G TZ{w)). 

For an aperiodic uniformly recurrent infinite word u, each factor w can be extended to the left 
and to the right to a bispecial factor. To describe the cardinality and the structure of TZ{w) for 
arbitrary w, it suffices therefore to consider bispecial factors w. 
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Figure 1. The tree of return words of 01 in the Thue-Morse sequence and its trie representation. 

3.2. Tree of return words. It is convenient to consider a tree (or trie) constructed in the 
following way: Label the root with a factor w, and attach ^£r{w) children, with labels wb, 
b G £r{w). Repeat this recursively with every node labeled by v, except if w is a suffix of v. If u 
is uniformly recurrent, then this algorithm stops, and it is easy to see that the labels of the leaves 
of this tree are exactly the complete return words of w. Therefore we have 

(1) #7eH = #{leaves} = 1 + ^ 1). 

non-lcavcs v 

In particular, if w is the unique right special factor of its length, then ^TZ{w) = 4j^£r{w). 

A similar construction can be done with left extensions, yielding similar formulae. Since we can 
restrict our attention to bispecial factors w by Section [3.11 we obtain the following proposition. 

Proposition 3.1. Let u he a recurrent word and m G N. Suppose that for every n G N at least 
one of the following conditions is satisfied: 

• There is a unique left special factor w G Cn{u), and 4i^£i{w) — m. 

• There is a unique right special factor w G Cn{u), and ^£r{w) — m. 
Then u satisfies property Rm, i-S-, every factor has exactly m return words. 

Recall that Arnoux-Rauzy words of order m are defined as uniformly recurrent infinite words 
which have for every n G N exactly one right special factor w of length n with ^£r{w) = m 
and exactly one left special factor w of length n with ^£e{w) = m. They are also called strict 
episturmian words. It is easy to see that Sturmian words are recurrent, and we obtain the following 
corollary to Proposition 13. II 

Corollary 3.2. Arnoux-Rauzy words of order m satisfy Rm, in particular Sturmian words satisfy 

4. Sufficient conditions for property R„i 

This section is devoted to sufficient conditions for a word u having the property Rm, but we 
mention first two evident necessary conditions. 

The alphabet ,4 of u must have m letters since the occurrences of the empty word are all 
integers n > 0, and its return words are therefore all letters w„. Furthermore, u must be uniformly 
recurrent since every factor has a return word and only finitely many of them. 

An important role in our further considerations is played by weak bispecial factors. 

Definition 4.1. A factor w of a recurrent word is weak bispecial if B{w) < 0, where 

B{w) = #{awb G jC{u) \ a,b e A} ~ #£tiw) - #£riw) + 1 

is the bilateral order ofw. 

Since #{awb G C{u) \ a,b e A} ^ J2ae£e(.w) "f^^riaw) = J2b££^{w) #^dwb), the inequality 
B{w) < is equivalent to 

J2 mr{aw) - 1) < i^£r{w) - 1 
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and to 

5^ mdwh) - 1) < #E,{yo) - 1. 

The bilateral order was defined by Cassaigne [4J in order to calculate the second complexity 
difference. If we set AC(n) = C{n + 1) — C(n), then we have 

AC(n)= (#^^H-1)- E (#^rH-l) 

and therefore 

AC{n + l)^ACin)= ^ ^ (#f,(aw) - I) - ^ (#f,H - I) 

= J2 (#{«^^ ^ '^(") I a, e -4} - - + 1) = E 5(10). 

If i3(u') = for all factors then the first complexity difference is constant. If no factor is 
weak bispecial, then AC(n) is non-decreasing. Since AC(0) = #yl — 1 and #A = m, we obtain 
the following lemma. 

Lemma 4.2. If u satisfies i?,„ and no factor is weak bispecial, then AC{n) > to— 1 for all n> 0. 

The number of return words can be bounded by the following lemmas. 
Lemma 4.3. If u is a uniformly recurrent word with no weak bispecial factor, then 

#n{w) >l + AC{\w\) 

for every factor w € C{u). 

Proof. Let w G C{u) and denote by fi, ■i;2, ■ ■ • , the right special factors of length \w\. Since no 
factor is weak bispecial and u is uniformly recurrent, every Vj can be extended to the left without 
decreasing the total amount of "right branching" until w is reached. More precisely, we have 
(mutually different) right special factors ''^^P ■ • • • ' ^'f'^ with suffix Vj, prefix w and no other 
occurrence of w such that #£r{vj) — 1 < J^iLiiif^riVj^^) — !)■ Since all wj*^ are nodes in the tree 
of return words and -wj'^ ^ Vj] ^ if (j, i) ^ (j', i'), we can use (P) and obtain 

#n{w) >l + Y, E(#£,.(^;f ) - 1) > 1 + Y,{#£r{v,) - 1) = I + AC{\w\). □ 
j=i 1=1 j=i 

Lemma 4.4. If u has no weak bispecial factor and AC{n) < to for all n>Q, then 

#TZ{w) < TO 

for every factor w e C{u). 

Proof. Let vi,V2, . ■ . ,Vr denote the right special factors which are labels of non- leave nodes in 
the tree of return words of w, and n = maxi<j<,. \vj\. Since no bispecial factor is weak, every Vj 
can be extended to the left to factors of length n without decreasing the total amount of "right 
branching". More precisely, we have (mutually different) right special factors v'"j^\v'^^\ . . . 
of length n with suffix Vj such that f/=£r{vj) — 1 < '^iLiiif^ri'Vj^'^) — 1). Since w occurs in Vj only 
as prefix, no Vj can be a proper suffix of Vj>. Hence we have uj'"' ^ Vjl if {j, i) ^ (j', i') and 

r r 

#n{w) = 1 + E {4£r{vj) - 1) < 1 + E E {*^r(vf) - 1) < 1 + AC{n) < m. □ 

j=l J=l 1=1 

For words with no weak bispecial factors, these three lemmas give a very simple characterization 
of the property Rm- 

Theorem 4.5. If u is a uniformly recurrent word with no weak bispecial factor, then it satisfies 
Rm if and only if C{n) — (m — l)n + 1 for all n > 0. 
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5. Properties R2 and 

For m = 2 and m = 3, we can completely characterize the words with property Rm- 

Definition 5.1. Let v he a return word of w Cz C{u). We say that return word v starts with b if 
wb is a prefix of the complete return word vw and that it ends with a if aw is a suffix ofvw. 

A right special factor w is called maximal right special if w is not a proper suffix of any right 
special factor, i.e., J2a<£Se{w)i'ff'^r{ciw) — 1) = 0. Any maximal right special factor is therefore 
weak bispecial. 

Lemma 5.2. If w G 'C(u) is a maximal right special factor such that for any b G £r{w) there 
exists a unique v G TZ{w) starting with b, then u is eventually periodic. 

Proof. Denote the return words of w hy vi,V2, . . . ,Vr, where, w.l.o.g., Vj starts with hj, ends with 
Oj and bj+i is the only letter in £r{o-jw) for 1 < j < r. Then 61 is the only letter in Srio-rw) and 
u = p{viV2 ■ ■ ■ Wr)°° for some prefix p. □ 

Corollary 5.3. If u satisfies R2, then it has no maximal right special factor. 

Proof. Assume that w is a maximal right special factor. Then the two return words of w have 
different starting letters, hence u is eventually periodic by Lemma [5?2l and fj^TZ{wa) = 1. □ 

On a binary alphabet, the notions "weak bispecial" and "maximal right special" coincide. 
Therefore Corollaries 13.21 15.31 and Lemma 14.31 provide a short proof of the following theorem. 

Theorem 5.4 (Vuillon (13]). An infinite word u satisfies R2 if and only if it is Sturmian. 

For words with property i?3, we need the following lemma. 

Lemma 5.5. Let w be a weak bispecial factor with a unique a G £e{w) such that more than one 
return word of w starts with a letter in £r{aw), then =fl=IZ(aw) < ^TZ{w). 

Proof. Any return word of aw has the form aviV2 ■ ■ ■ Vj-a^^ for some r > 1 and Vj G TZ{w), 
^ ^ j l£ f- If vi ends with a, then r = 1. If ui ends with a' ^ a, then the assumption of the 
lemma implies that there is a unique return word of w starting with a letter in £r{a'w) (and 
^£r{a'w) = 1). Therefore V2 and inductively the sequence of words V2,...,Vr are completely 
determined by the choice of vi. This implies that ^TZ{aw) equals the number of return words of 
w starting with a letter in ^£r{aw). Since w is weak bispecial, we have ^£r{aw) < ^£r{w) and 
thus i^n{aw) < #n{w). □ 

Remark. There are two cases for Lemma 15.51 Either aw is right special or there is more than one 
return word of w starting with the unique right extension of aw. 

Corollary 5.6. If u satisfies R:^, then it has no weak bispecial factor. 

Proof. Assume that w is a weak bispecial factor. 

If 4j^£r{w) — 3, then every return word of w starts with a different letter in £r{w). Since at most 
for one a G £e{w), the factor aw is right special, we obtain a contradiction to R^ by Lemma 15.21 
or 15.51 

If ^£r{w) ~ 2, then £r{aw) = {6} and £r{a'w) = {&'}• Since, w.l.o.g., two return words of w 
start with b and one starts with b' , we obtain a contradiction to R^ by Lemma 15.51 □ 

By combining Corollarv l5.6l and Theorem 14. 5i we obtain the following theorem. 

Theorem 5.7. A uniformly recurrent word u satisfies R^ if and only if C{n) = 2n + 1 for all 
n>Q and u has no weak bispecial factor. 

Remarks. 

• The theorem remains true if "weak bispecial" is replaced by "maximal right special" : If 
AC{n) = 2 for all n > 0, then every factor w with ^£r{w) = 3 is the unique right special 
factor of its length, and it cannot be weak bispecial. If = 2, then the two notions 

coincide. 
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• By symmetry, "weak bispecial" can be replaced by "maximal left special" . 

• The condition on weak bispecial factors cannot be omitted. Ferenczi 6 showed that the 
fixed point cr°°{l) of the substitution given by cr : 1 12, 2 i-^ 312, 3 i-^ 3312, a receding 
of the Chacon substitution, has complexity 2n + 1 and it contains weak bispecial factors. 

6. Property R4 

6.1. A word with complexity 7^ 3n+ 1. The following proposition shows that C{n) need not 
be (m — l)n + 1 for all n > if m satisfies Rm- 

Proposition 6.1. Define the substitution a by 

cr : 1 13231 

2 ^ 13231424131 

3 ^ 42324131424 

4 ^ 42324 

Then the fixed point (T°°(1) satisfies R^. 

Proof. By Section [3.11 it is sufficient to consider bispecial factors of u = (t°°{1). The factors 
of length 2 are C2iu) = {13,14,23,24,31,32,41,42}. For the bispecial factors 1,2,23,2413, the 
return words are easily determined: 

n{l) = {13, 1323, 1424, 142324} 
n{2) = {23, 2314, 2413, 241314} 
7^(23) = {2314,2314241314,232413,232413142413} 
7e(2413) = {241314,24131423,24132314,2413231423} 

The language of u is closed under the morphism ip defined by 93 : 1 <-> 4, 2 ^ 3, since <7ip{w) = 
if(j{w) for all factors w. Therefore we have TZ{(p{w)) = (p{TZ{w)). 

The only factors of the form alb, a,b G A are 314 and 413, hence 1 is a weak bispecial factor, 
and 1,4 are the only bispecial factors with prefix or suffix 1 or 4. Similarly, 23 and 32 are weak 
bispecial factors and no other bispecial factor has prefix or suffix 23 or 32. 

The return words of the weak bispecial factor 2413142 are factors of <t{v), with a factor v of 
length lij] > 2 having prefix 2 or 3, suffix 2 or 3 and no other occurrence of 2 and 3. Since the 
only possibilites for v are 23, 2413, 32, 3142, we obtain 

7^(2413142) = {24131423, 241314232413231423, 24131424132314, 241314241323142324132314}. 

All remaining bispecial factors w have prefix 24132 or 31423 and suffix 23142 or 32413, and 
therefore a decomposition w — ta{v)t' with t G {24,31}, t' G {1323142,4232413} and a unique 
bispecial factor v. If v is empty, then we have w.l.o.g. w = 241323142 and 

n{w) = {2413231423, 2413231423241314, 2413231424131423, 24132314241314232413242}. 

If V is not empty, then the uniqueness of v implies that the set of complete return words of w 
is ta{TZ{v)v)t' . Since v is shorter than w, we obtain inductively that all bispecial factors have 
exactly 4 return words. □ 

6.2. Weak bispecial factors. The preceding example shows that weak bispecial factors cannot 
be excluded in words u satisfying R4. Nevertheless, we can show that the existence of a weak 
bispecial factor imposes strong restrictions on the structure of the word u. 

Lemma 6.2. Let w be a weak bispecial factor of a word u satisfying R4. Then there exist factors 
Wi,W2 S Aw U wA and wi , U2 , f 3 , such that 

(2) 'R-iwi) = {V1W3, ■i;2W4} andTZ{w2) = {w3'yi, '!'3'y2, V4z;i, i;4z;2}. 

Proof. Let ui be a weak bispecial factor. Since the problem is symmetric, assume, w.l.o.g., 
4j^£r{w) > We have three different situations: 
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• 4t^£r{w) — 2: We have £r{aw) ~ 1 for all a e Siiw). For both b E Sriw), there must be 
more than one return word of w starting with b by Lemma 15.51 Therefore, there exists 
two return words of w starting with each b. Let wi = wbi, W2 = wb2. 

• 4f^£r{w) = 3: There exists a unique b G £r{w) such that two return words of w start with b. 
By Lemma 15.51 there exists therefore some a S £e{w) such that ^£r{aw) > 2. Since w 
is weak bispecial with =l^£r{w) = 3, we have 4l^£r{aw) = 2 and £r{a'w) = 1 for all other 
a' £ £i{'w). Let wi = aw, W2 = wb. 

• ^£r{w) — 4: For every b £ £r(w), there is a unique return word of w starting with b. By 
Lemma [5.51 we have ai, a2 & £t.{w) with 4t^£r(aiw) — 2. Let wi = aiw, W2 = a2W. 

Consider "complete return words of the set {'Wi,W2}" ■ words which have either wi or W2 as 
prefix, either wi or W2 as sufhx, and no other occurrence of wi and W2- By the definitions of wi 
and W2, there are exactly two such words viWi-^,V2Wi2 with prefix wi and two words V3Wi.^,ViWi^ 
with prefix W2- 

li ii — i2 — 2 and — {4 = 1, then R4 implies that ([2]) holds. 

If ii — 12 — 1, then wi has only the two return words vi,V2. If 12 = is = H = 1, then the return 
words of wi are viV3,viV4,V2- Similarly, ^3 = 14 = 2 and ii — i2 — is = 2 are not possible. 

The only remaining case is ii = ~ 1, i2 — is ^ 2. Then the return words of wi are vi and 
V2V^'V4, i G {1,2,3}, < ri < r2 < ^3. The return words of W2 arc V3 and W4W^'W2, « S {li2,3}, 

< Si < S2 < S3- 

The return words of ^2^2 are therefore of the form V2vJ^^V4vl\ Let Si be the set of these 4 
pairs {ri,Sj). Similarly, let S2 be the set of the 4 pairs {si,rj) such that V4vl^V2V^^ is a return 
word of V4W1. 

We show that there must be some i G {1,2,3} such that {ri,S2) G and (^^,53) e Si, by 
considering the return words of vl^wi and of wJ^W2W2- The return words of vl'^V2W2 are of the form 
vl^V2tvl"V4vl'~'''' with t G (?;3W4wfi?;2)*, « G {1, 2, 3} and j G {2,3}. For these t and r^, vl^V2tvl"V4 
is a return word of uj^wi. If there was no r.; with (r.^, S2) G •S'l and (r^, S3) G 6*1, then these words 
would provide 4 different return words of v^^wi, wich contradicts R4 since vi is another return 
word. 

Similarly, we must have some i G {1,2,3} such that (si,r2) G S2 and (si,r3) G 82- By 
considering the return words of vl^wi and V4vl'^wi, we obtain as well the existence of some 

1 G {1, 2, 3} such that (r2, s^) G 5*1 and (^3, s^) G Si. Finally, we must also have some i G {1, 2, 3} 
such that {s2,ri) G S2 and (s3,ri) G 82- 

Putting everything together, we have two possibilities for S'l. Either it contains (ri, si) and no 
other {ri, Sj) with j = 1 or j = 1, or S'l = {(ri, S2), (ri, S3), (r-2, si), (r^, si)}. Analogously, S2 con- 
tains (si, ri) and no other (si, r^) with i — I oi j — 1, or ^2 — {(si, r2), (si, r3), (s2, ri), (S3, ri)}. 

If (ri,si) G Si and (si,ri) G ^2, then V2V^^V4Wi has only one return word, V2V^^V4vl^ . 
If (ri,si) ^ S'l and (si,ri) ^ S2, then V2V^^V4Wi has only two return words, V2V^^V4Vi^ and 
t;2U3^W4Wi^. If (?'i,si) G S'l and (si,ri) ^ S2, then the return words of V2V^^V4Wi are of the form 
V2V^^V4v'{^V2V^^V4V-^' with {ri,Sj) G Si \ {(ri,si)}, thus there are only three words. Similarly, 
V4vl^V2'W2 has only three return words if (ri,si) S'l and (si,ri) G S'2. 

This shows that ii — 14 — 1, 12 ~ is — 2 is impossible, and the lemma is proved. □ 



7. Words associated with /3-integers 

In this section, we describe a new class of infinite words with the property Rm- The language 
of these words is not necessarily closed under reversal. 

Consider the fixed point u = a°° (0) of a primitive substitution of the form 

a: 0*n 

1 ^ 0*^2 

(3) : 

m-2 ^ 0*'"-i(m-l) 



m — 1 I— > 
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for some integers m > 2, ti,t„i > 1 and ^2, ■ ■ • , ^m-i ^ 0. The incidence matrix of cr is a companion 
matrix of the polynomial a;™ — — • • • — tm- Let /3 > 1 be the dominant root of this polynomial 
(the Perron- Frobenius eigenvalue of the matrix). If 



where ^ denotes the lexicographic ordering, then cr is a (3 -substitution and /3 is a simple Parry 
number. It is easy to see that u codes in this case the sequence of distances between consecutive 
nonnegative /3-integers 



and a letter k corresponds to the distance tk+i/ P+ ■ ■ ■ + tm/ (3'^ (0 corresponds to distance 1.) 

Remark. The most prominent example of a /3-substitution is the Fibonacci substitution (to = 2, 
ti — t2 — \) , where j3 is the golden mean. 

It is not difficult to show that all prefixes of u are left special factors, with all to letters being 
left extensions (see e.g. Frougny, Masakova and Pelantova [9j). For every factor w, the tree of 
return words constructed by the left extensions (see Section 13. 2p contains therefore a node with 
TO children, the shortest prefix of u having w as suffix. The word u is uniformly recurrent since all 
fixed points of primitive substitutions have this property (Queffelec [121). Therefore every factor w 
has at least to return words. If there exists a left special factor which is not a prefix of u, then this 
factor has more than to return words. By Proposition [3Tll we obtain the following proposition. 

Proposition 7.1. If u = a°°{0) for some substitution a of the form (0], then it satisfies Rm if 
and only if C{n) = (m — l)n + 1 for all n > 0. 

Bernat, Masakova and Pelantova [2] characterized the fixed points of /3-substitutions satisfying 
AC(n) = TO — 1 for all n > 0. The techniques of their proof can also be used to construct non- prefix 
left special factors if cr is a substitution of the form ^ which is not a /3-substitution, and their 
conditions can be reformulated as in the following corollary. 

Corollary 7.2. If u = cr°°(0) for some substitution a of the form (0), then it has the property 
Rm if and only if 

• tjyi = ^ and 

• ij ' ' ' tm~iti ■ ■ ■ tj-i ^ ii • • • trn-i for all j £ {2, . . . , TO — 1}. 

Note that the language of u is closed under reversal if and only if ti ^ t2 — ■ ■ ■ — t^-i- In this 
case, u is an Arnoux-Rauzy word of order m. 
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